Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 55825 |
| Missing cells (%) | 19.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.2 MiB |
| Average record size in memory | 232.0 B |
Variable types
| Numeric | 17 |
|---|---|
| Categorical | 6 |
| DateTime | 1 |
| Unsupported | 4 |
| Text | 1 |
id is highly overall correlated with total_pymnt and 2 other fields | High correlation |
loan_amnt is highly overall correlated with funded_amnt and 4 other fields | High correlation |
funded_amnt is highly overall correlated with loan_amnt and 4 other fields | High correlation |
int_rate is highly overall correlated with int_rate3 | High correlation |
installment is highly overall correlated with loan_amnt and 4 other fields | High correlation |
delinq_2yrs is highly overall correlated with mths_since_last_delinq | High correlation |
mths_since_last_delinq is highly overall correlated with delinq_2yrs | High correlation |
open_acc is highly overall correlated with total_acc | High correlation |
total_acc is highly overall correlated with open_acc | High correlation |
out_prncp is highly overall correlated with loan_amnt and 3 other fields | High correlation |
total_pymnt is highly overall correlated with id and 5 other fields | High correlation |
total_rec_prncp is highly overall correlated with id and 2 other fields | High correlation |
total_rec_int is highly overall correlated with id and 5 other fields | High correlation |
int_rate3 is highly overall correlated with int_rate | High correlation |
term is highly overall correlated with out_prncp | High correlation |
loan_status is highly imbalanced (70.8%) | Imbalance |
term has 476 (4.8%) missing values | Missing |
int_rate has 476 (4.8%) missing values | Missing |
installment has 476 (4.8%) missing values | Missing |
emp_length has 881 (8.8%) missing values | Missing |
home_ownership has 476 (4.8%) missing values | Missing |
annual_inc has 476 (4.8%) missing values | Missing |
loan_status has 476 (4.8%) missing values | Missing |
purpose has 476 (4.8%) missing values | Missing |
dti has 476 (4.8%) missing values | Missing |
delinq_2yrs has 476 (4.8%) missing values | Missing |
earliest_cr_line has 476 (4.8%) missing values | Missing |
mths_since_last_delinq has 5900 (59.0%) missing values | Missing |
open_acc has 476 (4.8%) missing values | Missing |
revol_bal has 476 (4.8%) missing values | Missing |
total_acc has 476 (4.8%) missing values | Missing |
out_prncp has 476 (4.8%) missing values | Missing |
total_pymnt has 476 (4.8%) missing values | Missing |
total_rec_prncp has 476 (4.8%) missing values | Missing |
total_rec_int has 476 (4.8%) missing values | Missing |
wtd_loans has 10000 (100.0%) missing values | Missing |
interest_rate has 10000 (100.0%) missing values | Missing |
int_rate2 has 476 (4.8%) missing values | Missing |
num_rate has 10000 (100.0%) missing values | Missing |
numrate has 10000 (100.0%) missing values | Missing |
int_rate3 has 476 (4.8%) missing values | Missing |
id has unique values | Unique |
wtd_loans is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
interest_rate is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
num_rate is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
numrate is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
delinq_2yrs has 8025 (80.2%) zeros | Zeros |
out_prncp has 1169 (11.7%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-10 19:31:33.979559 |
|---|---|
| Analysis finished | 2023-11-10 19:32:31.388670 |
| Duration | 57.41 seconds |
| Software version | ydata-profiling vv4.6.1 |
| Download configuration | config.json |
id
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5143647.9 |
| Minimum | 571203 |
|---|---|
| Maximum | 10125066 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 571203 |
|---|---|
| 5-th percentile | 1223026.2 |
| Q1 | 2300882.5 |
| median | 5605038.5 |
| Q3 | 7435741 |
| 95-th percentile | 9705338.4 |
| Maximum | 10125066 |
| Range | 9553863 |
| Interquartile range (IQR) | 5134858.5 |
Descriptive statistics
| Standard deviation | 2827943.8 |
|---|---|
| Coefficient of variation (CV) | 0.54979344 |
| Kurtosis | -1.3032773 |
| Mean | 5143647.9 |
| Median Absolute Deviation (MAD) | 2418180 |
| Skewness | 0.034461233 |
| Sum | 5.1436479 × 1010 |
| Variance | 7.9972663 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 571203 | 1 | < 0.1% |
| 6888279 | 1 | < 0.1% |
| 6884941 | 1 | < 0.1% |
| 6885382 | 1 | < 0.1% |
| 6885826 | 1 | < 0.1% |
| 6886319 | 1 | < 0.1% |
| 6886848 | 1 | < 0.1% |
| 6887364 | 1 | < 0.1% |
| 6887824 | 1 | < 0.1% |
| 6888765 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| 571203 | 1 | |
| 641849 | 1 | |
| 694891 | 1 | |
| 734736 | 1 | |
| 784712 | 1 | |
| 807342 | 1 | |
| 843448 | 1 | |
| 880114 | 1 | |
| 974654 | 1 | |
| 999547 | 1 |
| Value | Count | Frequency (%) |
| 10125066 | 1 | |
| 10124808 | 1 | |
| 10123803 | 1 | |
| 10123620 | 1 | |
| 10123424 | 1 | |
| 10123100 | 1 | |
| 10122896 | 1 | |
| 10122772 | 1 | |
| 10122507 | 1 | |
| 10122303 | 1 |
loan_amnt
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 697 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14054.808 |
| Minimum | 1000 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 3300 |
| Q1 | 8000 |
| median | 12000 |
| Q3 | 19400 |
| 95-th percentile | 30000 |
| Maximum | 35000 |
| Range | 34000 |
| Interquartile range (IQR) | 11400 |
Descriptive statistics
| Standard deviation | 8108.6587 |
|---|---|
| Coefficient of variation (CV) | 0.57693133 |
| Kurtosis | -0.034504762 |
| Mean | 14054.808 |
| Median Absolute Deviation (MAD) | 5475 |
| Skewness | 0.75167769 |
| Sum | 1.4054808 × 108 |
| Variance | 65750346 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 720 | 7.2% |
| 15000 | 559 | 5.6% |
| 12000 | 546 | 5.5% |
| 20000 | 447 | 4.5% |
| 8000 | 334 | 3.3% |
| 6000 | 327 | 3.3% |
| 35000 | 304 | 3.0% |
| 16000 | 269 | 2.7% |
| 18000 | 264 | 2.6% |
| 5000 | 257 | 2.6% |
| Other values (687) | 5973 |
| Value | Count | Frequency (%) |
| 1000 | 30 | |
| 1100 | 2 | < 0.1% |
| 1150 | 1 | < 0.1% |
| 1200 | 22 | |
| 1225 | 1 | < 0.1% |
| 1325 | 1 | < 0.1% |
| 1350 | 1 | < 0.1% |
| 1400 | 8 | 0.1% |
| 1450 | 4 | < 0.1% |
| 1500 | 28 |
| Value | Count | Frequency (%) |
| 35000 | 304 | |
| 34975 | 1 | < 0.1% |
| 34800 | 1 | < 0.1% |
| 34500 | 1 | < 0.1% |
| 34475 | 3 | < 0.1% |
| 34350 | 1 | < 0.1% |
| 34000 | 9 | 0.1% |
| 33950 | 10 | 0.1% |
| 33600 | 4 | < 0.1% |
| 33425 | 12 | 0.1% |
funded_amnt
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 698 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14052.73 |
| Minimum | 1000 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 3300 |
| Q1 | 8000 |
| median | 12000 |
| Q3 | 19400 |
| 95-th percentile | 30000 |
| Maximum | 35000 |
| Range | 34000 |
| Interquartile range (IQR) | 11400 |
Descriptive statistics
| Standard deviation | 8107.6932 |
|---|---|
| Coefficient of variation (CV) | 0.57694791 |
| Kurtosis | -0.032890733 |
| Mean | 14052.73 |
| Median Absolute Deviation (MAD) | 5462.5 |
| Skewness | 0.75225334 |
| Sum | 1.405273 × 108 |
| Variance | 65734690 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 720 | 7.2% |
| 15000 | 559 | 5.6% |
| 12000 | 545 | 5.5% |
| 20000 | 447 | 4.5% |
| 8000 | 334 | 3.3% |
| 6000 | 327 | 3.3% |
| 35000 | 304 | 3.0% |
| 16000 | 269 | 2.7% |
| 18000 | 265 | 2.6% |
| 5000 | 257 | 2.6% |
| Other values (688) | 5973 |
| Value | Count | Frequency (%) |
| 1000 | 30 | |
| 1100 | 2 | < 0.1% |
| 1150 | 1 | < 0.1% |
| 1200 | 22 | |
| 1225 | 1 | < 0.1% |
| 1325 | 1 | < 0.1% |
| 1350 | 1 | < 0.1% |
| 1400 | 8 | 0.1% |
| 1450 | 4 | < 0.1% |
| 1500 | 28 |
| Value | Count | Frequency (%) |
| 35000 | 304 | |
| 34975 | 1 | < 0.1% |
| 34800 | 1 | < 0.1% |
| 34500 | 1 | < 0.1% |
| 34475 | 3 | < 0.1% |
| 34350 | 1 | < 0.1% |
| 34000 | 9 | 0.1% |
| 33950 | 10 | 0.1% |
| 33600 | 4 | < 0.1% |
| 33425 | 12 | 0.1% |
term
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Memory size | 78.2 KiB |
| 36 months | |
|---|---|
| 60 months |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 95240 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 60 months |
|---|---|
| 2nd row | 36 months |
| 3rd row | 60 months |
| 4th row | 36 months |
| 5th row | 36 months |
Common Values
| Value | Count | Frequency (%) |
| 36 months | 7269 | |
| 60 months | 2255 | 22.6% |
| (Missing) | 476 | 4.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| months | 9524 | |
| 36 | 7269 | |
| 60 | 2255 | 11.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 19048 | ||
| 6 | 9524 | |
| m | 9524 | |
| o | 9524 | |
| n | 9524 | |
| t | 9524 | |
| h | 9524 | |
| s | 9524 | |
| 3 | 7269 | 7.6% |
| 0 | 2255 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57144 | |
| Space Separator | 19048 | 20.0% |
| Decimal Number | 19048 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 9524 | |
| o | 9524 | |
| n | 9524 | |
| t | 9524 | |
| h | 9524 | |
| s | 9524 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 9524 | |
| 3 | 7269 | |
| 0 | 2255 | 11.8% |
Space Separator
| Value | Count | Frequency (%) |
| 19048 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57144 | |
| Common | 38096 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 9524 | |
| o | 9524 | |
| n | 9524 | |
| t | 9524 | |
| h | 9524 | |
| s | 9524 |
Common
| Value | Count | Frequency (%) |
| 19048 | ||
| 6 | 9524 | |
| 3 | 7269 | 19.1% |
| 0 | 2255 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95240 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 19048 | ||
| 6 | 9524 | |
| m | 9524 | |
| o | 9524 | |
| n | 9524 | |
| t | 9524 | |
| h | 9524 | |
| s | 9524 | |
| 3 | 7269 | 7.6% |
| 0 | 2255 | 2.4% |
int_rate
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.277852 |
| Minimum | 6.03 |
|---|---|
| Maximum | 26.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 6.03 |
|---|---|
| 5-th percentile | 7.62 |
| Q1 | 11.14 |
| median | 14.09 |
| Q3 | 17.27 |
| 95-th percentile | 22.4 |
| Maximum | 26.06 |
| Range | 20.03 |
| Interquartile range (IQR) | 6.13 |
Descriptive statistics
| Standard deviation | 4.4301591 |
|---|---|
| Coefficient of variation (CV) | 0.31028191 |
| Kurtosis | -0.46512934 |
| Mean | 14.277852 |
| Median Absolute Deviation (MAD) | 3.1 |
| Skewness | 0.24772703 |
| Sum | 135982.26 |
| Variance | 19.62631 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.12 | 485 | 4.9% |
| 13.11 | 432 | 4.3% |
| 8.9 | 357 | 3.6% |
| 14.33 | 351 | 3.5% |
| 7.9 | 321 | 3.2% |
| 11.14 | 318 | 3.2% |
| 15.31 | 285 | 2.9% |
| 16.29 | 265 | 2.6% |
| 7.62 | 262 | 2.6% |
| 10.16 | 223 | 2.2% |
| Other values (124) | 6225 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 6.03 | 220 | |
| 6.62 | 184 | |
| 6.97 | 15 | 0.1% |
| 7.51 | 12 | 0.1% |
| 7.62 | 262 | |
| 7.9 | 321 | |
| 8.6 | 23 | 0.2% |
| 8.9 | 357 | |
| 9.25 | 26 | 0.3% |
| 9.67 | 75 | 0.8% |
| Value | Count | Frequency (%) |
| 26.06 | 6 | 0.1% |
| 25.99 | 3 | < 0.1% |
| 25.89 | 8 | |
| 25.83 | 9 | |
| 25.8 | 9 | |
| 25.57 | 8 | |
| 25.28 | 5 | 0.1% |
| 24.99 | 11 | |
| 24.89 | 15 | |
| 24.83 | 7 |
installment
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 5174 |
|---|---|
| Distinct (%) | 54.3% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 442.62661 |
| Minimum | 30.44 |
|---|---|
| Maximum | 1388.45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 30.44 |
|---|---|
| 5-th percentile | 118.12 |
| Q1 | 266.575 |
| median | 398.51 |
| Q3 | 576.7375 |
| 95-th percentile | 920.5635 |
| Maximum | 1388.45 |
| Range | 1358.01 |
| Interquartile range (IQR) | 310.1625 |
Descriptive statistics
| Standard deviation | 244.52212 |
|---|---|
| Coefficient of variation (CV) | 0.55243429 |
| Kurtosis | 0.83526776 |
| Mean | 442.62661 |
| Median Absolute Deviation (MAD) | 149.515 |
| Skewness | 0.93200479 |
| Sum | 4215575.8 |
| Variance | 59791.065 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 332.72 | 40 | 0.4% |
| 399.26 | 36 | 0.4% |
| 499.08 | 34 | 0.3% |
| 337.47 | 34 | 0.3% |
| 665.44 | 29 | 0.3% |
| 328.06 | 29 | 0.3% |
| 404.97 | 25 | 0.2% |
| 343.39 | 25 | 0.2% |
| 412.06 | 25 | 0.2% |
| 635.07 | 23 | 0.2% |
| Other values (5164) | 9224 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 30.44 | 1 | |
| 31.17 | 1 | |
| 31.3 | 1 | |
| 32 | 1 | |
| 32.14 | 1 | |
| 32.35 | 1 | |
| 32.42 | 1 | |
| 32.81 | 1 | |
| 33.42 | 1 | |
| 33.75 | 1 |
| Value | Count | Frequency (%) |
| 1388.45 | 1 | |
| 1366.36 | 1 | |
| 1363.98 | 1 | |
| 1359.96 | 1 | |
| 1353.93 | 2 | |
| 1349.38 | 1 | |
| 1336.31 | 1 | |
| 1331.25 | 1 | |
| 1327.45 | 1 | |
| 1318.63 | 1 |
emp_length
Categorical
MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 881 |
| Missing (%) | 8.8% |
| Memory size | 78.2 KiB |
| 10+ years | |
|---|---|
| 2 years | |
| 5 years | |
| 3 years | |
| < 1 year | |
| Other values (6) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.6779252 |
| Min length | 6 |
Characters and Unicode
| Total characters | 70015 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10+ years |
|---|---|
| 2nd row | 10+ years |
| 3rd row | 2 years |
| 4th row | 3 years |
| 5th row | 2 years |
Common Values
| Value | Count | Frequency (%) |
| 10+ years | 3054 | |
| 2 years | 869 | 8.7% |
| 5 years | 753 | 7.5% |
| 3 years | 692 | 6.9% |
| < 1 year | 657 | 6.6% |
| 6 years | 618 | 6.2% |
| 1 year | 583 | 5.8% |
| 7 years | 558 | 5.6% |
| 4 years | 537 | 5.4% |
| 8 years | 449 | 4.5% |
| (Missing) | 881 | 8.8% |
Length
| Value | Count | Frequency (%) |
| years | 7879 | |
| 10 | 3054 | 16.2% |
| 1 | 1240 | 6.6% |
| year | 1240 | 6.6% |
| 2 | 869 | 4.6% |
| 5 | 753 | 4.0% |
| 3 | 692 | 3.7% |
| 657 | 3.5% | |
| 6 | 618 | 3.3% |
| 7 | 558 | 3.0% |
| Other values (3) | 1335 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9776 | ||
| y | 9119 | |
| e | 9119 | |
| a | 9119 | |
| r | 9119 | |
| s | 7879 | |
| 1 | 4294 | |
| 0 | 3054 | 4.4% |
| + | 3054 | 4.4% |
| 2 | 869 | 1.2% |
| Other values (8) | 4613 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44355 | |
| Decimal Number | 12173 | 17.4% |
| Space Separator | 9776 | 14.0% |
| Math Symbol | 3711 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4294 | |
| 0 | 3054 | |
| 2 | 869 | 7.1% |
| 5 | 753 | 6.2% |
| 3 | 692 | 5.7% |
| 6 | 618 | 5.1% |
| 7 | 558 | 4.6% |
| 4 | 537 | 4.4% |
| 8 | 449 | 3.7% |
| 9 | 349 | 2.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 9119 | |
| e | 9119 | |
| a | 9119 | |
| r | 9119 | |
| s | 7879 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3054 | |
| < | 657 | 17.7% |
Space Separator
| Value | Count | Frequency (%) |
| 9776 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44355 | |
| Common | 25660 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9776 | ||
| 1 | 4294 | |
| 0 | 3054 | 11.9% |
| + | 3054 | 11.9% |
| 2 | 869 | 3.4% |
| 5 | 753 | 2.9% |
| 3 | 692 | 2.7% |
| < | 657 | 2.6% |
| 6 | 618 | 2.4% |
| 7 | 558 | 2.2% |
| Other values (3) | 1335 | 5.2% |
Latin
| Value | Count | Frequency (%) |
| y | 9119 | |
| e | 9119 | |
| a | 9119 | |
| r | 9119 | |
| s | 7879 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70015 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9776 | ||
| y | 9119 | |
| e | 9119 | |
| a | 9119 | |
| r | 9119 | |
| s | 7879 | |
| 1 | 4294 | |
| 0 | 3054 | 4.4% |
| + | 3054 | 4.4% |
| 2 | 869 | 1.2% |
| Other values (8) | 4613 |
home_ownership
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Memory size | 78.2 KiB |
| MORTGAGE | |
|---|---|
| RENT | |
| OWN | |
| OTHER | 1 |
| NONE | 1 |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 5.9455061 |
| Min length | 3 |
Characters and Unicode
| Total characters | 56625 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MORTGAGE |
|---|---|
| 2nd row | MORTGAGE |
| 3rd row | MORTGAGE |
| 4th row | RENT |
| 5th row | RENT |
Common Values
| Value | Count | Frequency (%) |
| MORTGAGE | 4839 | |
| RENT | 3855 | |
| OWN | 828 | 8.3% |
| OTHER | 1 | < 0.1% |
| NONE | 1 | < 0.1% |
| (Missing) | 476 | 4.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mortgage | 4839 | |
| rent | 3855 | |
| own | 828 | 8.7% |
| other | 1 | < 0.1% |
| none | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 9678 | |
| E | 8696 | |
| R | 8695 | |
| T | 8695 | |
| O | 5669 | |
| M | 4839 | |
| A | 4839 | |
| N | 4685 | |
| W | 828 | 1.5% |
| H | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 56625 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 9678 | |
| E | 8696 | |
| R | 8695 | |
| T | 8695 | |
| O | 5669 | |
| M | 4839 | |
| A | 4839 | |
| N | 4685 | |
| W | 828 | 1.5% |
| H | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56625 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 9678 | |
| E | 8696 | |
| R | 8695 | |
| T | 8695 | |
| O | 5669 | |
| M | 4839 | |
| A | 4839 | |
| N | 4685 | |
| W | 828 | 1.5% |
| H | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56625 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 9678 | |
| E | 8696 | |
| R | 8695 | |
| T | 8695 | |
| O | 5669 | |
| M | 4839 | |
| A | 4839 | |
| N | 4685 | |
| W | 828 | 1.5% |
| H | 1 | < 0.1% |
annual_inc
Real number (ℝ)
MISSING 
| Distinct | 1492 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71655.752 |
| Minimum | 7500 |
|---|---|
| Maximum | 1000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 7500 |
|---|---|
| 5-th percentile | 29000 |
| Q1 | 45000 |
| median | 61000 |
| Q3 | 86000 |
| 95-th percentile | 140850 |
| Maximum | 1000000 |
| Range | 992500 |
| Interquartile range (IQR) | 41000 |
Descriptive statistics
| Standard deviation | 45362.834 |
|---|---|
| Coefficient of variation (CV) | 0.6330662 |
| Kurtosis | 61.214623 |
| Mean | 71655.752 |
| Median Absolute Deviation (MAD) | 19000 |
| Skewness | 5.0068795 |
| Sum | 6.8244938 × 108 |
| Variance | 2.0577868 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50000 | 392 | 3.9% |
| 60000 | 361 | 3.6% |
| 40000 | 290 | 2.9% |
| 65000 | 285 | 2.9% |
| 70000 | 266 | 2.7% |
| 55000 | 252 | 2.5% |
| 45000 | 245 | 2.5% |
| 75000 | 237 | 2.4% |
| 80000 | 231 | 2.3% |
| 35000 | 191 | 1.9% |
| Other values (1482) | 6774 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 7500 | 1 | < 0.1% |
| 8400 | 1 | < 0.1% |
| 8832 | 1 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 10492.8 | 1 | < 0.1% |
| 11000 | 2 | < 0.1% |
| 11111 | 1 | < 0.1% |
| 11853 | 1 | < 0.1% |
| 12000 | 7 | |
| 12400 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000000 | 1 | |
| 900009 | 1 | |
| 897000 | 1 | |
| 760000 | 1 | |
| 600000 | 2 | |
| 550000 | 1 | |
| 525000 | 1 | |
| 500000 | 2 | |
| 450000 | 1 | |
| 444000 | 1 |
loan_status
Categorical
IMBALANCE  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Memory size | 78.2 KiB |
| Current | |
|---|---|
| Fully Paid | |
| Charged Off | 218 |
| Late (31-120 days) | 148 |
| In Grace Period | 48 |
| Other values (2) | 37 |
Length
| Max length | 18 |
|---|---|
| Median length | 7 |
| Mean length | 7.6244225 |
| Min length | 7 |
Characters and Unicode
| Total characters | 72615 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Current |
|---|---|
| 2nd row | Current |
| 3rd row | Late (31-120 days) |
| 4th row | Fully Paid |
| 5th row | Current |
Common Values
| Value | Count | Frequency (%) |
| Current | 8122 | |
| Fully Paid | 951 | 9.5% |
| Charged Off | 218 | 2.2% |
| Late (31-120 days) | 148 | 1.5% |
| In Grace Period | 48 | 0.5% |
| Late (16-30 days) | 21 | 0.2% |
| Default | 16 | 0.2% |
| (Missing) | 476 | 4.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| current | 8122 | |
| fully | 951 | 8.5% |
| paid | 951 | 8.5% |
| charged | 218 | 2.0% |
| off | 218 | 2.0% |
| late | 169 | 1.5% |
| days | 169 | 1.5% |
| 31-120 | 148 | 1.3% |
| in | 48 | 0.4% |
| grace | 48 | 0.4% |
| Other values (3) | 85 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 16558 | |
| u | 9089 | |
| e | 8621 | |
| C | 8340 | |
| t | 8307 | |
| n | 8170 | |
| l | 1918 | 2.6% |
| 1603 | 2.2% | |
| a | 1571 | 2.2% |
| d | 1386 | 1.9% |
| Other values (23) | 7052 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 58892 | |
| Uppercase Letter | 10789 | 14.9% |
| Space Separator | 1603 | 2.2% |
| Decimal Number | 824 | 1.1% |
| Open Punctuation | 169 | 0.2% |
| Dash Punctuation | 169 | 0.2% |
| Close Punctuation | 169 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 16558 | |
| u | 9089 | |
| e | 8621 | |
| t | 8307 | |
| n | 8170 | |
| l | 1918 | 3.3% |
| a | 1571 | 2.7% |
| d | 1386 | 2.4% |
| y | 1120 | 1.9% |
| i | 999 | 1.7% |
| Other values (6) | 1153 | 2.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 8340 | |
| P | 999 | 9.3% |
| F | 951 | 8.8% |
| O | 218 | 2.0% |
| L | 169 | 1.6% |
| I | 48 | 0.4% |
| G | 48 | 0.4% |
| D | 16 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 317 | |
| 3 | 169 | |
| 0 | 169 | |
| 2 | 148 | |
| 6 | 21 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1603 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 169 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 169 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 169 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 69681 | |
| Common | 2934 | 4.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 16558 | |
| u | 9089 | |
| e | 8621 | |
| C | 8340 | |
| t | 8307 | |
| n | 8170 | |
| l | 1918 | 2.8% |
| a | 1571 | 2.3% |
| d | 1386 | 2.0% |
| y | 1120 | 1.6% |
| Other values (14) | 4601 | 6.6% |
Common
| Value | Count | Frequency (%) |
| 1603 | ||
| 1 | 317 | 10.8% |
| ( | 169 | 5.8% |
| 3 | 169 | 5.8% |
| - | 169 | 5.8% |
| 0 | 169 | 5.8% |
| ) | 169 | 5.8% |
| 2 | 148 | 5.0% |
| 6 | 21 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 16558 | |
| u | 9089 | |
| e | 8621 | |
| C | 8340 | |
| t | 8307 | |
| n | 8170 | |
| l | 1918 | 2.6% |
| 1603 | 2.2% | |
| a | 1571 | 2.2% |
| d | 1386 | 1.9% |
| Other values (23) | 7052 |
purpose
Categorical
MISSING 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Memory size | 78.2 KiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| home_improvement | 497 |
| other | 431 |
| major_purchase | 189 |
| Other values (8) | 528 |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 15.064679 |
| Min length | 3 |
Characters and Unicode
| Total characters | 143476 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | credit_card |
|---|---|
| 2nd row | small_business |
| 3rd row | small_business |
| 4th row | debt_consolidation |
| 5th row | debt_consolidation |
Common Values
| Value | Count | Frequency (%) |
| debt_consolidation | 5665 | |
| credit_card | 2214 | 22.1% |
| home_improvement | 497 | 5.0% |
| other | 431 | 4.3% |
| major_purchase | 189 | 1.9% |
| small_business | 147 | 1.5% |
| car | 81 | 0.8% |
| medical | 72 | 0.7% |
| wedding | 61 | 0.6% |
| house | 55 | 0.5% |
| Other values (3) | 112 | 1.1% |
| (Missing) | 476 | 4.8% |
Length
| Value | Count | Frequency (%) |
| debt_consolidation | 5665 | |
| credit_card | 2214 | 23.2% |
| home_improvement | 497 | 5.2% |
| other | 431 | 4.5% |
| major_purchase | 189 | 2.0% |
| small_business | 147 | 1.5% |
| car | 81 | 0.9% |
| medical | 72 | 0.8% |
| wedding | 61 | 0.6% |
| house | 55 | 0.6% |
| Other values (3) | 112 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 18764 | |
| d | 15952 | |
| t | 14522 | |
| i | 14421 | |
| n | 12159 | |
| c | 10485 | |
| e | 10385 | |
| _ | 8724 | 6.1% |
| a | 8669 | 6.0% |
| s | 6497 | 4.5% |
| Other values (12) | 22898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 134752 | |
| Connector Punctuation | 8724 | 6.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 18764 | |
| d | 15952 | |
| t | 14522 | |
| i | 14421 | |
| n | 12159 | |
| c | 10485 | |
| e | 10385 | |
| a | 8669 | |
| s | 6497 | 4.8% |
| l | 6043 | 4.5% |
| Other values (11) | 16855 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8724 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 134752 | |
| Common | 8724 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 18764 | |
| d | 15952 | |
| t | 14522 | |
| i | 14421 | |
| n | 12159 | |
| c | 10485 | |
| e | 10385 | |
| a | 8669 | |
| s | 6497 | 4.8% |
| l | 6043 | 4.5% |
| Other values (11) | 16855 |
Common
| Value | Count | Frequency (%) |
| _ | 8724 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143476 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 18764 | |
| d | 15952 | |
| t | 14522 | |
| i | 14421 | |
| n | 12159 | |
| c | 10485 | |
| e | 10385 | |
| _ | 8724 | 6.1% |
| a | 8669 | 6.0% |
| s | 6497 | 4.5% |
| Other values (12) | 22898 |
addr_state
Categorical
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| CA | |
|---|---|
| NY | |
| TX | |
| FL | |
| NJ | 400 |
| Other values (40) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MA |
|---|---|
| 2nd row | WA |
| 3rd row | NY |
| 4th row | NJ |
| 5th row | VA |
Common Values
| Value | Count | Frequency (%) |
| CA | 1685 | |
| NY | 868 | 8.7% |
| TX | 783 | 7.8% |
| FL | 648 | 6.5% |
| NJ | 400 | 4.0% |
| PA | 367 | 3.7% |
| IL | 362 | 3.6% |
| VA | 317 | 3.2% |
| GA | 310 | 3.1% |
| NC | 287 | 2.9% |
| Other values (35) | 3973 |
Length
| Value | Count | Frequency (%) |
| ca | 1685 | |
| ny | 868 | 8.7% |
| tx | 783 | 7.8% |
| fl | 648 | 6.5% |
| nj | 400 | 4.0% |
| pa | 367 | 3.7% |
| il | 362 | 3.6% |
| va | 317 | 3.2% |
| ga | 310 | 3.1% |
| nc | 287 | 2.9% |
| Other values (35) | 3973 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 3685 | |
| C | 2474 | |
| N | 2211 | |
| L | 1257 | 6.3% |
| T | 1187 | 5.9% |
| M | 1013 | 5.1% |
| Y | 1009 | 5.0% |
| I | 954 | 4.8% |
| O | 841 | 4.2% |
| X | 783 | 3.9% |
| Other values (14) | 4586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 20000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3685 | |
| C | 2474 | |
| N | 2211 | |
| L | 1257 | 6.3% |
| T | 1187 | 5.9% |
| M | 1013 | 5.1% |
| Y | 1009 | 5.0% |
| I | 954 | 4.8% |
| O | 841 | 4.2% |
| X | 783 | 3.9% |
| Other values (14) | 4586 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 3685 | |
| C | 2474 | |
| N | 2211 | |
| L | 1257 | 6.3% |
| T | 1187 | 5.9% |
| M | 1013 | 5.1% |
| Y | 1009 | 5.0% |
| I | 954 | 4.8% |
| O | 841 | 4.2% |
| X | 783 | 3.9% |
| Other values (14) | 4586 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 3685 | |
| C | 2474 | |
| N | 2211 | |
| L | 1257 | 6.3% |
| T | 1187 | 5.9% |
| M | 1013 | 5.1% |
| Y | 1009 | 5.0% |
| I | 954 | 4.8% |
| O | 841 | 4.2% |
| X | 783 | 3.9% |
| Other values (14) | 4586 |
dti
Real number (ℝ)
MISSING 
| Distinct | 2955 |
|---|---|
| Distinct (%) | 31.0% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.146927 |
| Minimum | 0 |
|---|---|
| Maximum | 34.98 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.1 |
| Q1 | 11.52 |
| median | 16.84 |
| Q3 | 22.59 |
| 95-th percentile | 30.0885 |
| Maximum | 34.98 |
| Range | 34.98 |
| Interquartile range (IQR) | 11.07 |
Descriptive statistics
| Standard deviation | 7.5916009 |
|---|---|
| Coefficient of variation (CV) | 0.44273828 |
| Kurtosis | -0.64702699 |
| Mean | 17.146927 |
| Median Absolute Deviation (MAD) | 5.53 |
| Skewness | 0.13199079 |
| Sum | 163307.33 |
| Variance | 57.632404 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.37 | 12 | 0.1% |
| 14.4 | 11 | 0.1% |
| 17.58 | 11 | 0.1% |
| 16.29 | 11 | 0.1% |
| 12.63 | 11 | 0.1% |
| 14.74 | 11 | 0.1% |
| 17.32 | 11 | 0.1% |
| 13.14 | 10 | 0.1% |
| 11.6 | 10 | 0.1% |
| 11.35 | 10 | 0.1% |
| Other values (2945) | 9416 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 0.01 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.15 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.25 | 3 | |
| 0.26 | 1 | < 0.1% |
| 0.41 | 1 | < 0.1% |
| 0.45 | 2 | |
| 0.47 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 34.98 | 2 | |
| 34.97 | 1 | < 0.1% |
| 34.96 | 4 | |
| 34.95 | 1 | < 0.1% |
| 34.9 | 2 | |
| 34.88 | 1 | < 0.1% |
| 34.86 | 1 | < 0.1% |
| 34.85 | 1 | < 0.1% |
| 34.84 | 1 | < 0.1% |
| 34.8 | 1 | < 0.1% |
delinq_2yrs
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.23876522 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 8025 |
| Zeros (%) | 80.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.69145547 |
|---|---|
| Coefficient of variation (CV) | 2.8959639 |
| Kurtosis | 39.205653 |
| Mean | 0.23876522 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.9444618 |
| Sum | 2274 |
| Variance | 0.47811067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8025 | |
| 1 | 1027 | 10.3% |
| 2 | 316 | 3.2% |
| 3 | 90 | 0.9% |
| 4 | 33 | 0.3% |
| 5 | 12 | 0.1% |
| 6 | 10 | 0.1% |
| 8 | 4 | < 0.1% |
| 7 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 8025 | |
| 1 | 1027 | 10.3% |
| 2 | 316 | 3.2% |
| 3 | 90 | 0.9% |
| 4 | 33 | 0.3% |
| 5 | 12 | 0.1% |
| 6 | 10 | 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 4 | < 0.1% |
| 7 | 3 | < 0.1% |
| 6 | 10 | 0.1% |
| 5 | 12 | 0.1% |
| 4 | 33 | 0.3% |
| 3 | 90 | 0.9% |
| 2 | 316 | 3.2% |
| 1 | 1027 |
earliest_cr_line
Date
MISSING 
| Distinct | 9397 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Memory size | 78.2 KiB |
| Minimum | 1970-01-12 12:47:00 |
|---|---|
| Maximum | 2069-12-27 12:00:00 |
mths_since_last_delinq
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 87 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 5900 |
| Missing (%) | 59.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.988537 |
| Minimum | 0 |
|---|---|
| Maximum | 122 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 17 |
| median | 32 |
| Q3 | 49 |
| 95-th percentile | 75 |
| Maximum | 122 |
| Range | 122 |
| Interquartile range (IQR) | 32 |
Descriptive statistics
| Standard deviation | 21.474509 |
|---|---|
| Coefficient of variation (CV) | 0.61375842 |
| Kurtosis | -0.76340332 |
| Mean | 34.988537 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 0.4635379 |
| Sum | 143453 |
| Variance | 461.15454 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 93 | 0.9% |
| 8 | 89 | 0.9% |
| 17 | 84 | 0.8% |
| 14 | 84 | 0.8% |
| 10 | 84 | 0.8% |
| 12 | 82 | 0.8% |
| 36 | 81 | 0.8% |
| 15 | 80 | 0.8% |
| 11 | 79 | 0.8% |
| 7 | 78 | 0.8% |
| Other values (77) | 3266 | |
| (Missing) | 5900 |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.1% |
| 1 | 26 | 0.3% |
| 2 | 27 | 0.3% |
| 3 | 21 | 0.2% |
| 4 | 32 | 0.3% |
| 5 | 42 | |
| 6 | 50 | |
| 7 | 78 | |
| 8 | 89 | |
| 9 | 76 |
| Value | Count | Frequency (%) |
| 122 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 86 | 1 | < 0.1% |
| 83 | 4 | < 0.1% |
| 82 | 7 | 0.1% |
| 81 | 32 | |
| 80 | 37 | |
| 79 | 23 | |
| 78 | 25 | |
| 77 | 25 |
open_acc
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 36 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.043784 |
| Minimum | 1 |
|---|---|
| Maximum | 39 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 10 |
| Q3 | 14 |
| 95-th percentile | 20 |
| Maximum | 39 |
| Range | 38 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.561028 |
|---|---|
| Coefficient of variation (CV) | 0.41299503 |
| Kurtosis | 1.3156019 |
| Mean | 11.043784 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.93522306 |
| Sum | 105181 |
| Variance | 20.802976 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 948 | |
| 10 | 902 | |
| 8 | 892 | |
| 11 | 847 | 8.5% |
| 7 | 801 | 8.0% |
| 12 | 798 | 8.0% |
| 6 | 628 | 6.3% |
| 13 | 600 | 6.0% |
| 14 | 488 | 4.9% |
| 15 | 439 | 4.4% |
| Other values (26) | 2181 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 29 | 0.3% |
| 3 | 81 | 0.8% |
| 4 | 202 | 2.0% |
| 5 | 404 | |
| 6 | 628 | |
| 7 | 801 | |
| 8 | 892 | |
| 9 | 948 | |
| 10 | 902 |
| Value | Count | Frequency (%) |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 32 | 3 | < 0.1% |
| 31 | 6 | 0.1% |
| 30 | 3 | < 0.1% |
| 29 | 4 | < 0.1% |
| 28 | 9 | |
| 27 | 16 |
revol_bal
Real number (ℝ)
MISSING 
| Distinct | 8180 |
|---|---|
| Distinct (%) | 85.9% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15982.998 |
| Minimum | 0 |
|---|---|
| Maximum | 376679 |
| Zeros | 25 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2646.15 |
| Q1 | 7151 |
| median | 12495 |
| Q3 | 20596 |
| 95-th percentile | 38079.85 |
| Maximum | 376679 |
| Range | 376679 |
| Interquartile range (IQR) | 13445 |
Descriptive statistics
| Standard deviation | 15177.648 |
|---|---|
| Coefficient of variation (CV) | 0.94961208 |
| Kurtosis | 72.197975 |
| Mean | 15982.998 |
| Median Absolute Deviation (MAD) | 6204 |
| Skewness | 5.6083162 |
| Sum | 1.5222208 × 108 |
| Variance | 2.30361 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 25 | 0.2% |
| 7201 | 5 | 0.1% |
| 15883 | 4 | < 0.1% |
| 14631 | 4 | < 0.1% |
| 5410 | 4 | < 0.1% |
| 7719 | 4 | < 0.1% |
| 13651 | 4 | < 0.1% |
| 6150 | 4 | < 0.1% |
| 11446 | 4 | < 0.1% |
| 5220 | 4 | < 0.1% |
| Other values (8170) | 9462 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 25 | |
| 1 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 19 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 376679 | 1 | |
| 262741 | 1 | |
| 245619 | 1 | |
| 225925 | 1 | |
| 212032 | 1 | |
| 201757 | 1 | |
| 199713 | 1 | |
| 195540 | 1 | |
| 186291 | 1 | |
| 178858 | 1 |
total_acc
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 64 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.51764 |
| Minimum | 3 |
|---|---|
| Maximum | 68 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 17 |
| median | 23 |
| Q3 | 31 |
| 95-th percentile | 45 |
| Maximum | 68 |
| Range | 65 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.887693 |
|---|---|
| Coefficient of variation (CV) | 0.44407589 |
| Kurtosis | 0.43208523 |
| Mean | 24.51764 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.7221529 |
| Sum | 233506 |
| Variance | 118.54185 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 402 | 4.0% |
| 19 | 380 | 3.8% |
| 21 | 376 | 3.8% |
| 18 | 373 | 3.7% |
| 22 | 351 | 3.5% |
| 25 | 341 | 3.4% |
| 16 | 337 | 3.4% |
| 23 | 337 | 3.4% |
| 17 | 332 | 3.3% |
| 15 | 332 | 3.3% |
| Other values (54) | 5963 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 3 | 13 | 0.1% |
| 4 | 18 | 0.2% |
| 5 | 37 | 0.4% |
| 6 | 72 | 0.7% |
| 7 | 95 | 0.9% |
| 8 | 109 | |
| 9 | 159 | |
| 10 | 191 | |
| 11 | 209 | |
| 12 | 246 |
| Value | Count | Frequency (%) |
| 68 | 1 | < 0.1% |
| 67 | 1 | < 0.1% |
| 65 | 1 | < 0.1% |
| 63 | 24 | |
| 62 | 8 | 0.1% |
| 61 | 6 | 0.1% |
| 60 | 11 | |
| 59 | 10 | |
| 58 | 6 | 0.1% |
| 57 | 16 |
out_prncp
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 8224 |
|---|---|
| Distinct (%) | 86.4% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10253.674 |
| Minimum | 0 |
|---|---|
| Maximum | 34413.52 |
| Zeros | 1169 |
| Zeros (%) | 11.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4273.3875 |
| median | 8745.425 |
| Q3 | 15055.438 |
| 95-th percentile | 25997.232 |
| Maximum | 34413.52 |
| Range | 34413.52 |
| Interquartile range (IQR) | 10782.05 |
Descriptive statistics
| Standard deviation | 7963.3 |
|---|---|
| Coefficient of variation (CV) | 0.77662893 |
| Kurtosis | 0.10410446 |
| Mean | 10253.674 |
| Median Absolute Deviation (MAD) | 5238.94 |
| Skewness | 0.78593003 |
| Sum | 97655993 |
| Variance | 63414148 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1169 | 11.7% |
| 9296.47 | 4 | < 0.1% |
| 11155.76 | 4 | < 0.1% |
| 9533.31 | 3 | < 0.1% |
| 14289.49 | 3 | < 0.1% |
| 7155.53 | 3 | < 0.1% |
| 11104.66 | 3 | < 0.1% |
| 32542.4 | 3 | < 0.1% |
| 11176.13 | 3 | < 0.1% |
| 18140.92 | 3 | < 0.1% |
| Other values (8214) | 8326 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 1169 | |
| 92.92 | 1 | < 0.1% |
| 197.25 | 1 | < 0.1% |
| 197.48 | 1 | < 0.1% |
| 231.47 | 1 | < 0.1% |
| 322.5 | 1 | < 0.1% |
| 348.14 | 1 | < 0.1% |
| 391.67 | 1 | < 0.1% |
| 446.86 | 1 | < 0.1% |
| 495.27 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 34413.52 | 1 | |
| 34411.69 | 1 | |
| 34381.72 | 1 | |
| 34374.93 | 1 | |
| 34334.69 | 1 | |
| 34326.1 | 1 | |
| 34318.96 | 1 | |
| 34315.92 | 1 | |
| 34306.82 | 1 | |
| 34291.79 | 1 |
total_pymnt
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 9282 |
|---|---|
| Distinct (%) | 97.5% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5225.2409 |
| Minimum | 34.14 |
|---|---|
| Maximum | 44231.08 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 34.14 |
|---|---|
| 5-th percentile | 566.269 |
| Q1 | 1676.3125 |
| median | 3500.04 |
| Q3 | 6736.965 |
| 95-th percentile | 16303.613 |
| Maximum | 44231.08 |
| Range | 44196.94 |
| Interquartile range (IQR) | 5060.6525 |
Descriptive statistics
| Standard deviation | 5499.4787 |
|---|---|
| Coefficient of variation (CV) | 1.0524833 |
| Kurtosis | 8.8775758 |
| Mean | 5225.2409 |
| Median Absolute Deviation (MAD) | 2179.975 |
| Skewness | 2.552875 |
| Sum | 49765195 |
| Variance | 30244265 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1195.56 | 4 | < 0.1% |
| 996.3 | 4 | < 0.1% |
| 664.2 | 4 | < 0.1% |
| 6298.27 | 3 | < 0.1% |
| 1964.04 | 3 | < 0.1% |
| 7873.32 | 3 | < 0.1% |
| 3057.6 | 3 | < 0.1% |
| 1992.6 | 3 | < 0.1% |
| 2591.16 | 3 | < 0.1% |
| 1441.12 | 3 | < 0.1% |
| Other values (9272) | 9491 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 34.14 | 1 | |
| 42.46 | 1 | |
| 46.62 | 1 | |
| 73.42 | 1 | |
| 73.94 | 1 | |
| 74.48 | 1 | |
| 79.68 | 1 | |
| 85.39 | 1 | |
| 94.81 | 1 | |
| 95.32 | 1 |
| Value | Count | Frequency (%) |
| 44231.08 | 1 | |
| 44121.01 | 1 | |
| 43391.6 | 1 | |
| 43004.93 | 1 | |
| 42651.38 | 1 | |
| 40986.76 | 1 | |
| 40929.44 | 1 | |
| 40889.22 | 1 | |
| 40153.71 | 1 | |
| 39981.69 | 1 |
total_rec_prncp
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 8644 |
|---|---|
| Distinct (%) | 90.8% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3808.5013 |
| Minimum | 22.5 |
|---|---|
| Maximum | 35000.01 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 22.5 |
|---|---|
| 5-th percentile | 337.7345 |
| Q1 | 1027.525 |
| median | 2237.87 |
| Q3 | 4544.47 |
| 95-th percentile | 13597.027 |
| Maximum | 35000.01 |
| Range | 34977.51 |
| Interquartile range (IQR) | 3516.945 |
Descriptive statistics
| Standard deviation | 4801.5012 |
|---|---|
| Coefficient of variation (CV) | 1.2607325 |
| Kurtosis | 11.95299 |
| Mean | 3808.5013 |
| Median Absolute Deviation (MAD) | 1435.875 |
| Skewness | 3.0714179 |
| Sum | 36272166 |
| Variance | 23054414 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12000 | 65 | 0.7% |
| 10000 | 50 | 0.5% |
| 6000 | 46 | 0.5% |
| 20000 | 41 | 0.4% |
| 15000 | 39 | 0.4% |
| 16000 | 33 | 0.3% |
| 5000 | 31 | 0.3% |
| 8000 | 30 | 0.3% |
| 18000 | 25 | 0.2% |
| 35000 | 24 | 0.2% |
| Other values (8634) | 9140 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 22.5 | 1 | |
| 23.27 | 1 | |
| 26.75 | 1 | |
| 27.17 | 1 | |
| 41.18 | 1 | |
| 41.8 | 1 | |
| 45.87 | 1 | |
| 50.06 | 1 | |
| 50.39 | 1 | |
| 55.99 | 1 |
| Value | Count | Frequency (%) |
| 35000.01 | 1 | < 0.1% |
| 35000 | 24 | |
| 34975 | 1 | < 0.1% |
| 34677.5 | 1 | < 0.1% |
| 34350 | 1 | < 0.1% |
| 34000 | 1 | < 0.1% |
| 33950 | 3 | < 0.1% |
| 33600 | 1 | < 0.1% |
| 33425 | 1 | < 0.1% |
| 33075 | 1 | < 0.1% |
total_rec_int
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 9229 |
|---|---|
| Distinct (%) | 96.9% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1412.894 |
| Minimum | 11.64 |
|---|---|
| Maximum | 13514.55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 11.64 |
|---|---|
| 5-th percentile | 149.296 |
| Q1 | 468.1125 |
| median | 947 |
| Q3 | 1777.87 |
| 95-th percentile | 4406.5475 |
| Maximum | 13514.55 |
| Range | 13502.91 |
| Interquartile range (IQR) | 1309.7575 |
Descriptive statistics
| Standard deviation | 1489.2275 |
|---|---|
| Coefficient of variation (CV) | 1.0540264 |
| Kurtosis | 9.5744922 |
| Mean | 1412.894 |
| Median Absolute Deviation (MAD) | 574.47 |
| Skewness | 2.5963065 |
| Sum | 13456402 |
| Variance | 2217798.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 292.77 | 4 | < 0.1% |
| 351.32 | 4 | < 0.1% |
| 400.79 | 4 | < 0.1% |
| 231.13 | 3 | < 0.1% |
| 134.13 | 3 | < 0.1% |
| 986.41 | 3 | < 0.1% |
| 1453.8 | 3 | < 0.1% |
| 475.65 | 3 | < 0.1% |
| 4191.21 | 3 | < 0.1% |
| 195.36 | 3 | < 0.1% |
| Other values (9219) | 9491 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 11.64 | 1 | |
| 12.45 | 1 | |
| 13.04 | 1 | |
| 13.4 | 1 | |
| 15.29 | 1 | |
| 15.33 | 1 | |
| 17.62 | 1 | |
| 18.7 | 1 | |
| 19.36 | 1 | |
| 19.71 | 1 |
| Value | Count | Frequency (%) |
| 13514.55 | 1 | |
| 13331.14 | 1 | |
| 13298.51 | 1 | |
| 12643.97 | 1 | |
| 12617.02 | 1 | |
| 12156.87 | 1 | |
| 11918.17 | 1 | |
| 11687.98 | 1 | |
| 11549.15 | 1 | |
| 11506.46 | 1 |
wtd_loans
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 10000 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 78.2 KiB |
interest_rate
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 10000 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 78.2 KiB |
int_rate2
Text
MISSING 
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Memory size | 78.2 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.8189836 |
| Min length | 5 |
Characters and Unicode
| Total characters | 55420 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 10.16% |
|---|---|
| 2nd row | 8.90% |
| 3rd row | 7.90% |
| 4th row | 13.67% |
| 5th row | 15.80% |
| Value | Count | Frequency (%) |
| 12.12 | 485 | 5.1% |
| 13.11 | 432 | 4.5% |
| 8.90 | 357 | 3.7% |
| 14.33 | 351 | 3.7% |
| 7.90 | 321 | 3.4% |
| 11.14 | 318 | 3.3% |
| 15.31 | 285 | 3.0% |
| 16.29 | 265 | 2.8% |
| 7.62 | 262 | 2.8% |
| 10.16 | 223 | 2.3% |
| Other values (124) | 6225 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 10517 | |
| . | 9524 | |
| % | 9524 | |
| 2 | 4315 | |
| 9 | 3785 | 6.8% |
| 0 | 3313 | 6.0% |
| 7 | 2926 | 5.3% |
| 3 | 2654 | 4.8% |
| 6 | 2557 | 4.6% |
| 5 | 2514 | 4.5% |
| Other values (2) | 3791 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36372 | |
| Other Punctuation | 19048 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10517 | |
| 2 | 4315 | |
| 9 | 3785 | 10.4% |
| 0 | 3313 | 9.1% |
| 7 | 2926 | 8.0% |
| 3 | 2654 | 7.3% |
| 6 | 2557 | 7.0% |
| 5 | 2514 | 6.9% |
| 8 | 1910 | 5.3% |
| 4 | 1881 | 5.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9524 | |
| % | 9524 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 55420 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 10517 | |
| . | 9524 | |
| % | 9524 | |
| 2 | 4315 | |
| 9 | 3785 | 6.8% |
| 0 | 3313 | 6.0% |
| 7 | 2926 | 5.3% |
| 3 | 2654 | 4.8% |
| 6 | 2557 | 4.6% |
| 5 | 2514 | 4.5% |
| Other values (2) | 3791 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55420 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 10517 | |
| . | 9524 | |
| % | 9524 | |
| 2 | 4315 | |
| 9 | 3785 | 6.8% |
| 0 | 3313 | 6.0% |
| 7 | 2926 | 5.3% |
| 3 | 2654 | 4.8% |
| 6 | 2557 | 4.6% |
| 5 | 2514 | 4.5% |
| Other values (2) | 3791 | 6.8% |
num_rate
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 10000 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 78.2 KiB |
numrate
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 10000 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 78.2 KiB |
int_rate3
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 476 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.277852 |
| Minimum | 6.03 |
|---|---|
| Maximum | 26.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 6.03 |
|---|---|
| 5-th percentile | 7.62 |
| Q1 | 11.14 |
| median | 14.09 |
| Q3 | 17.27 |
| 95-th percentile | 22.4 |
| Maximum | 26.06 |
| Range | 20.03 |
| Interquartile range (IQR) | 6.13 |
Descriptive statistics
| Standard deviation | 4.4301591 |
|---|---|
| Coefficient of variation (CV) | 0.31028191 |
| Kurtosis | -0.46512934 |
| Mean | 14.277852 |
| Median Absolute Deviation (MAD) | 3.1 |
| Skewness | 0.24772703 |
| Sum | 135982.26 |
| Variance | 19.62631 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.12 | 485 | 4.9% |
| 13.11 | 432 | 4.3% |
| 8.9 | 357 | 3.6% |
| 14.33 | 351 | 3.5% |
| 7.9 | 321 | 3.2% |
| 11.14 | 318 | 3.2% |
| 15.31 | 285 | 2.9% |
| 16.29 | 265 | 2.6% |
| 7.62 | 262 | 2.6% |
| 10.16 | 223 | 2.2% |
| Other values (124) | 6225 | |
| (Missing) | 476 | 4.8% |
| Value | Count | Frequency (%) |
| 6.03 | 220 | |
| 6.62 | 184 | |
| 6.97 | 15 | 0.1% |
| 7.51 | 12 | 0.1% |
| 7.62 | 262 | |
| 7.9 | 321 | |
| 8.6 | 23 | 0.2% |
| 8.9 | 357 | |
| 9.25 | 26 | 0.3% |
| 9.67 | 75 | 0.8% |
| Value | Count | Frequency (%) |
| 26.06 | 6 | 0.1% |
| 25.99 | 3 | < 0.1% |
| 25.89 | 8 | |
| 25.83 | 9 | |
| 25.8 | 9 | |
| 25.57 | 8 | |
| 25.28 | 5 | 0.1% |
| 24.99 | 11 | |
| 24.89 | 15 | |
| 24.83 | 7 |
| id | loan_amnt | funded_amnt | int_rate | installment | annual_inc | dti | delinq_2yrs | mths_since_last_delinq | open_acc | revol_bal | total_acc | out_prncp | total_pymnt | total_rec_prncp | total_rec_int | int_rate3 | term | emp_length | home_ownership | loan_status | purpose | addr_state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 1.000 | 0.000 | 0.001 | 0.075 | 0.022 | 0.039 | 0.027 | 0.067 | -0.048 | 0.052 | 0.005 | 0.070 | 0.364 | -0.689 | -0.708 | -0.504 | 0.075 | 0.096 | 0.025 | 0.040 | 0.125 | 0.051 | 0.033 |
| loan_amnt | 0.000 | 1.000 | 1.000 | 0.143 | 0.970 | 0.480 | 0.048 | 0.029 | -0.053 | 0.206 | 0.500 | 0.258 | 0.751 | 0.521 | 0.396 | 0.674 | 0.143 | 0.459 | 0.043 | 0.101 | 0.021 | 0.103 | 0.024 |
| funded_amnt | 0.001 | 1.000 | 1.000 | 0.143 | 0.970 | 0.480 | 0.048 | 0.028 | -0.053 | 0.206 | 0.500 | 0.258 | 0.752 | 0.521 | 0.396 | 0.674 | 0.143 | 0.458 | 0.043 | 0.101 | 0.021 | 0.103 | 0.024 |
| int_rate | 0.075 | 0.143 | 0.143 | 1.000 | 0.139 | -0.030 | 0.142 | 0.115 | -0.082 | -0.001 | 0.011 | -0.025 | 0.144 | 0.027 | -0.150 | 0.364 | 1.000 | 0.493 | 0.000 | 0.058 | 0.043 | 0.089 | 0.028 |
| installment | 0.022 | 0.970 | 0.970 | 0.139 | 1.000 | 0.472 | 0.048 | 0.038 | -0.057 | 0.204 | 0.492 | 0.242 | 0.707 | 0.549 | 0.447 | 0.667 | 0.139 | 0.298 | 0.042 | 0.086 | 0.019 | 0.098 | 0.026 |
| annual_inc | 0.039 | 0.480 | 0.480 | -0.030 | 0.472 | 1.000 | -0.227 | 0.099 | -0.089 | 0.232 | 0.408 | 0.342 | 0.361 | 0.251 | 0.212 | 0.263 | -0.030 | 0.032 | 0.005 | 0.065 | 0.000 | 0.045 | 0.012 |
| dti | 0.027 | 0.048 | 0.048 | 0.142 | 0.048 | -0.227 | 1.000 | -0.023 | 0.053 | 0.297 | 0.234 | 0.229 | 0.082 | -0.010 | -0.042 | 0.078 | 0.142 | 0.076 | 0.019 | 0.037 | 0.034 | 0.061 | 0.045 |
| delinq_2yrs | 0.067 | 0.029 | 0.028 | 0.115 | 0.038 | 0.099 | -0.023 | 1.000 | -0.823 | 0.060 | -0.041 | 0.165 | 0.031 | -0.020 | -0.035 | 0.012 | 0.115 | 0.005 | 0.007 | 0.000 | 0.014 | 0.013 | 0.037 |
| mths_since_last_delinq | -0.048 | -0.053 | -0.053 | -0.082 | -0.057 | -0.089 | 0.053 | -0.823 | 1.000 | -0.051 | -0.013 | -0.093 | -0.041 | -0.005 | 0.009 | -0.015 | -0.082 | 0.036 | 0.022 | 0.030 | 0.000 | 0.000 | 0.000 |
| open_acc | 0.052 | 0.206 | 0.206 | -0.001 | 0.204 | 0.232 | 0.297 | 0.060 | -0.051 | 1.000 | 0.348 | 0.663 | 0.184 | 0.080 | 0.061 | 0.110 | -0.001 | 0.073 | 0.012 | 0.082 | 0.008 | 0.027 | 0.011 |
| revol_bal | 0.005 | 0.500 | 0.500 | 0.011 | 0.492 | 0.408 | 0.234 | -0.041 | -0.013 | 0.348 | 1.000 | 0.313 | 0.389 | 0.277 | 0.230 | 0.327 | 0.011 | 0.041 | 0.024 | 0.066 | 0.000 | 0.027 | 0.033 |
| total_acc | 0.070 | 0.258 | 0.258 | -0.025 | 0.242 | 0.342 | 0.229 | 0.165 | -0.093 | 0.663 | 0.313 | 1.000 | 0.201 | 0.101 | 0.077 | 0.112 | -0.025 | 0.124 | 0.049 | 0.127 | 0.006 | 0.032 | 0.047 |
| out_prncp | 0.364 | 0.751 | 0.752 | 0.144 | 0.707 | 0.361 | 0.082 | 0.031 | -0.041 | 0.184 | 0.389 | 0.201 | 1.000 | 0.041 | -0.082 | 0.428 | 0.144 | 0.506 | 0.044 | 0.099 | 0.298 | 0.076 | 0.032 |
| total_pymnt | -0.689 | 0.521 | 0.521 | 0.027 | 0.549 | 0.251 | -0.010 | -0.020 | -0.005 | 0.080 | 0.277 | 0.101 | 0.041 | 1.000 | 0.968 | 0.768 | 0.027 | 0.110 | 0.007 | 0.021 | 0.233 | 0.044 | 0.025 |
| total_rec_prncp | -0.708 | 0.396 | 0.396 | -0.150 | 0.447 | 0.212 | -0.042 | -0.035 | 0.009 | 0.061 | 0.230 | 0.077 | -0.082 | 0.968 | 1.000 | 0.627 | -0.150 | 0.168 | 0.004 | 0.017 | 0.277 | 0.036 | 0.012 |
| total_rec_int | -0.504 | 0.674 | 0.674 | 0.364 | 0.667 | 0.263 | 0.078 | 0.012 | -0.015 | 0.110 | 0.327 | 0.112 | 0.428 | 0.768 | 0.627 | 1.000 | 0.364 | 0.415 | 0.013 | 0.029 | 0.042 | 0.036 | 0.029 |
| int_rate3 | 0.075 | 0.143 | 0.143 | 1.000 | 0.139 | -0.030 | 0.142 | 0.115 | -0.082 | -0.001 | 0.011 | -0.025 | 0.144 | 0.027 | -0.150 | 0.364 | 1.000 | 0.493 | 0.000 | 0.058 | 0.043 | 0.089 | 0.028 |
| term | 0.096 | 0.459 | 0.458 | 0.493 | 0.298 | 0.032 | 0.076 | 0.005 | 0.036 | 0.073 | 0.041 | 0.124 | 0.506 | 0.110 | 0.168 | 0.415 | 0.493 | 1.000 | 0.086 | 0.121 | 0.065 | 0.090 | 0.058 |
| emp_length | 0.025 | 0.043 | 0.043 | 0.000 | 0.042 | 0.005 | 0.019 | 0.007 | 0.022 | 0.012 | 0.024 | 0.049 | 0.044 | 0.007 | 0.004 | 0.013 | 0.000 | 0.086 | 1.000 | 0.107 | 0.000 | 0.018 | 0.016 |
| home_ownership | 0.040 | 0.101 | 0.101 | 0.058 | 0.086 | 0.065 | 0.037 | 0.000 | 0.030 | 0.082 | 0.066 | 0.127 | 0.099 | 0.021 | 0.017 | 0.029 | 0.058 | 0.121 | 0.107 | 1.000 | 0.000 | 0.087 | 0.135 |
| loan_status | 0.125 | 0.021 | 0.021 | 0.043 | 0.019 | 0.000 | 0.034 | 0.014 | 0.000 | 0.008 | 0.000 | 0.006 | 0.298 | 0.233 | 0.277 | 0.042 | 0.043 | 0.065 | 0.000 | 0.000 | 1.000 | 0.032 | 0.023 |
| purpose | 0.051 | 0.103 | 0.103 | 0.089 | 0.098 | 0.045 | 0.061 | 0.013 | 0.000 | 0.027 | 0.027 | 0.032 | 0.076 | 0.044 | 0.036 | 0.036 | 0.089 | 0.090 | 0.018 | 0.087 | 0.032 | 1.000 | 0.027 |
| addr_state | 0.033 | 0.024 | 0.024 | 0.028 | 0.026 | 0.012 | 0.045 | 0.037 | 0.000 | 0.011 | 0.033 | 0.047 | 0.032 | 0.025 | 0.012 | 0.029 | 0.028 | 0.058 | 0.016 | 0.135 | 0.023 | 0.027 | 1.000 |
| id | loan_amnt | funded_amnt | term | int_rate | installment | emp_length | home_ownership | annual_inc | loan_status | purpose | addr_state | dti | delinq_2yrs | earliest_cr_line | mths_since_last_delinq | open_acc | revol_bal | total_acc | out_prncp | total_pymnt | total_rec_prncp | total_rec_int | wtd_loans | interest_rate | int_rate2 | num_rate | numrate | int_rate3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 571203 | 18000 | 18000 | 60 months | 10.16 | 383.87 | 10+ years | MORTGAGE | 72804.0 | Current | credit_card | MA | 16.73 | 0.0 | 1995-12-27 02:06:00 | NaN | 21.0 | 8751.0 | 49.0 | 13263.18 | 7273.77 | 4736.82 | 2536.95 | NaN | NaN | 10.16% | NaN | NaN | 10.16 |
| 1 | 694891 | 15675 | 15675 | 36 months | 8.90 | 497.74 | 10+ years | MORTGAGE | 100000.0 | Current | small_business | WA | 9.10 | 0.0 | 1994-04-07 12:00:00 | NaN | 16.0 | 20650.0 | 45.0 | 15294.25 | 496.78 | 380.75 | 116.03 | NaN | NaN | 8.90% | NaN | NaN | 8.90 |
| 2 | 784712 | 16500 | 16500 | 60 months | 7.90 | 333.78 | 2 years | MORTGAGE | 42000.0 | Late (31-120 days) | small_business | NY | 10.43 | 0.0 | 1993-07-16 08:41:00 | NaN | 9.0 | 2229.0 | 17.0 | 12966.64 | 5000.85 | 3533.36 | 1467.49 | NaN | NaN | 7.90% | NaN | NaN | 7.90 |
| 3 | 843448 | 5500 | 5500 | 36 months | 13.67 | 187.10 | 3 years | RENT | 55000.0 | Fully Paid | debt_consolidation | NJ | 20.71 | 0.0 | 1987-07-24 12:40:00 | NaN | 17.0 | 9486.0 | 25.0 | 0.00 | 5792.14 | 5500.00 | 292.14 | NaN | NaN | 13.67% | NaN | NaN | 13.67 |
| 4 | 974654 | 6400 | 6400 | 36 months | 15.80 | 224.38 | 2 years | RENT | 34000.0 | Current | debt_consolidation | VA | 32.40 | 0.0 | 1998-03-15 06:57:00 | 47.0 | 6.0 | 4915.0 | 15.0 | 4430.59 | 2912.26 | 1969.41 | 942.85 | NaN | NaN | 15.80% | NaN | NaN | 15.80 |
| 5 | 1023119 | 1400 | 1400 | 36 months | 15.96 | 49.20 | 3 years | MORTGAGE | 67000.0 | Fully Paid | home_improvement | NV | 19.57 | 0.0 | 2004-01-01 12:16:00 | 61.0 | 8.0 | 13806.0 | 14.0 | 0.00 | 1687.48 | 1400.00 | 287.48 | NaN | NaN | 15.96% | NaN | NaN | 15.96 |
| 6 | 1042871 | 6250 | 6250 | 36 months | 7.51 | 194.45 | 7 years | MORTGAGE | 33600.0 | Current | debt_consolidation | CA | 18.05 | 0.0 | 2005-08-13 06:31:00 | NaN | 7.0 | 5174.0 | 10.0 | 2072.55 | 4847.25 | 4177.45 | 669.80 | NaN | NaN | 7.51% | NaN | NaN | 7.51 |
| 7 | 1055193 | 7300 | 7300 | 36 months | 13.49 | 247.70 | 3 years | RENT | 50000.0 | Current | small_business | FL | 19.06 | 0.0 | 2006-01-03 09:58:00 | NaN | 8.0 | 12026.0 | 13.0 | 2554.60 | 6185.50 | 4745.40 | 1440.10 | NaN | NaN | 13.49% | NaN | NaN | 13.49 |
| 8 | 1059509 | 20000 | 20000 | 60 months | 17.27 | 499.96 | 10+ years | MORTGAGE | 80000.0 | Current | debt_consolidation | VA | 15.06 | 0.0 | 2001-03-04 05:02:00 | NaN | 11.0 | 21592.0 | 30.0 | 13404.04 | 12928.21 | 6595.96 | 6332.25 | NaN | NaN | 17.27% | NaN | NaN | 17.27 |
| 9 | 1063649 | 17500 | 16800 | 60 months | 22.74 | 471.10 | 6 years | MORTGAGE | 95000.0 | Charged Off | debt_consolidation | WA | 24.78 | 0.0 | 2002-01-30 07:50:00 | NaN | 12.0 | 23722.0 | 23.0 | 0.00 | 4704.90 | 1662.30 | 3042.60 | NaN | NaN | 22.74% | NaN | NaN | 22.74 |
| id | loan_amnt | funded_amnt | term | int_rate | installment | emp_length | home_ownership | annual_inc | loan_status | purpose | addr_state | dti | delinq_2yrs | earliest_cr_line | mths_since_last_delinq | open_acc | revol_bal | total_acc | out_prncp | total_pymnt | total_rec_prncp | total_rec_int | wtd_loans | interest_rate | int_rate2 | num_rate | numrate | int_rate3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | 10075898 | 2000 | 2000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NJ | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9991 | 10090475 | 4000 | 4000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | VA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9992 | 10092119 | 8000 | 8000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | GA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9993 | 10092861 | 10625 | 10625 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | TN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9994 | 10105197 | 6500 | 6500 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | MO | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9995 | 10105778 | 10000 | 10000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | KY | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9996 | 10109949 | 15000 | 15000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9997 | 10112187 | 3500 | 3500 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NY | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9998 | 10119897 | 10000 | 10000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9999 | 10123100 | 4000 | 4000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | GA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |